# Industrial Image Analysis
Internvl3 38B Instruct GGUF
Apache-2.0
InternVL3-38B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.
Image-to-Text
Transformers

I
unsloth
1,236
2
Internvl3 8B Instruct GGUF
Apache-2.0
InternVL3-8B-Instruct is an advanced multimodal large language model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.
Text-to-Image
Transformers

I
unsloth
2,412
1
Internvl3 8B GGUF
Apache-2.0
InternVL3 is an advanced multimodal large language model series, demonstrating exceptional overall performance with robust multimodal perception and reasoning capabilities.
Image-to-Text
Transformers

I
unsloth
4,810
3
Internvl3 78B Hf
Other
InternVL3 is an advanced multimodal large language model series with powerful multimodal perception and reasoning capabilities, supporting image, video, and text inputs.
Image-to-Text
Transformers Other

I
OpenGVLab
40
1
Internvl3 1B AWQ
Other
InternVL3-1B is a multimodal large language model in the InternVL3 series, featuring exceptional multimodal perception and reasoning capabilities.
Text-to-Image
Transformers Other

I
OpenGVLab
303
1
Internvl3 2B Instruct
Apache-2.0
InternVL3-2B-Instruct is a supervised fine-tuned version based on InternVL3-2B, undergoing native multimodal pretraining and SFT processing, equipped with powerful multimodal perception and reasoning capabilities.
Text-to-Image
Transformers Other

I
OpenGVLab
1,345
4
Internvl3 1B Instruct
Apache-2.0
InternVL3-1B-Instruct is the supervised fine-tuned version of the InternVL3 series, based on native multimodal pretraining, with exceptional multimodal perception and reasoning capabilities.
Image-to-Text
Transformers Other

I
OpenGVLab
705
5
Internvl3 78B Instruct
Other
InternVL3-78B-Instruct is an advanced multimodal large language model developed by OpenGVLab, demonstrating exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Image-to-Text
Transformers Other

I
OpenGVLab
345
5
Featured Recommended AI Models